video
2dn
video2dn
Найти
Сохранить видео с ютуба
Категории
Музыка
Кино и Анимация
Автомобили
Животные
Спорт
Путешествия
Игры
Люди и Блоги
Юмор
Развлечения
Новости и Политика
Howto и Стиль
Diy своими руками
Образование
Наука и Технологии
Некоммерческие Организации
О сайте
Видео ютуба по тегу Muon Optimizer
Modular Manifolds: Optimization for Neural Networks
Muon: More Efficient LLM Pretraining
Jeremy Bernstein - Depths of First Order Optimization
The Muon Optimizer: How Newton-Schulz Enables 2x Faster LLM Training (AdamW Killer?)
Muon is Scalable for LLM Training (Feb 2025)
How To Find Best Optimizer For Your AI Model & LLM - Lion, Muon, Shampoo, SOAP, AdamW
Muon Optimizer for Dense Linear Layer Explained | Newton-Schulz + Momentum
2502.16982 - Muon is Scalable for LLM Training
Practical Efficiency of Muon for Pretraining
Code Muon in Triton - Be 0.1% AI Researcher, Job At OpenAI, DeepMind, Anthropic, xAI
How To VIBE CODE AI Research Paper - SGD vs Muon Optimizer - Beginners
Muon Optimizer: Matrix - Orthogonal, Rotation, Normal, Muon - EP3 - Road To 0.1% AI Researcher
Muon: Faster LLM Pretraining
Muon Outperforms Adam in Tail-End Associative Memory Learning - Fengzhuo Zhang | ASAP 41
Muon - Part 1
MuonClip Optimizer for LLM Training (used in Kimi 2)
Muon - Part 2
The shape of a manifold optimizer. Muon Optimizer. Modular Manifolds. Deep Learning Theory. GenAI.
LIVE I'm Learning Muon Optimizer (2x Faster LLM Training)
Kimi-K2, Muon, 이건 두번째 레슨, 노토님이 말아주는 Kimi-K2와 Muon
2X Faster AI Training? Unpacking the Muon Optimizer That’s Replacing AdamW
AdamW Optimizer from Scratch in Python
Code, Write & Publish AI Research Paper - Full Course - LLM From Scratch - Muon vs Adam Optimizer
Impossible Muons
This Simple Optimizer Is Revolutionizing How We Train AI [Muon]
Следующая страница»